Goto

Collaborating Authors

 K-12 Education



LAUSD bans screen time before the second grade, among the strictest policies in the nation

Los Angeles Times

Things to Do in L.A. Tap to enable a layout that focuses on the article. Fifth grade students work on computers at their South Los Angeles school in 2019. This is read by an automated voice. Please report any issues or inconsistencies here . Los Angeles Unified will ban classroom screen time in preschool through first grade and sharply limit it for older students.


AI in the classroom prompts tide of concern from US parents and experts

The Guardian

'There is this overwhelming sense that ed tech companies are deciding what kids learn, and teachers are just being put into this position of tech support instead of driving the decisions about what is best for kids in terms of learning.' 'There is this overwhelming sense that ed tech companies are deciding what kids learn, and teachers are just being put into this position of tech support instead of driving the decisions about what is best for kids in terms of learning.' In October, Kelly Clancy's son received an assignment in sixth grade at a middle school in Brooklyn, New York, to create a science experiment and then ask Google Gemini, an artificial intelligence chatbot, for feedback, she said. Clancy, who has three children in New York City public schools, told the teacher that the bot "is something that just teaches kids that they can have machines do the thinking for them", instead of suggesting: "Let's talk to your partners. What about the science experiment could you improve?" Clancy also founded Parents for AI Caution in Educational Spaces, a group pushing the city to institute a two-year moratorium on using AI in its public schools.


Carvalho resigns as LAUSD superintendent amid federal investigation

Los Angeles Times

Things to Do in L.A. Tap to enable a layout that focuses on the article. Alberto Carvalho, who resigned Sunday as LAUSD superintendent, addresses students at an elementary school in 2023. This is read by an automated voice. Please report any issues or inconsistencies here . Alberto Carvalho resigned Sunday night.


Norway imposes broad restrictions on AI for elementary school kids

Engadget

This follows a smartphone and tablet ban in classrooms. Norway is imposing a strict ban on the use of generative AI tools by elementary school kids, according to a report by . Prime Minister Jonas Gahr Stoere suggested at a press conference that AI lets children skip crucial steps in their education and that schools should focus on teaching them how to read, write and do mathematics. These standards will be imposed at the start of the new school year, which begins in late August. However, the policy also extends to teens, albeit in a reduced fashion.


MemSim: ABayesian Simulator for Evaluating Memory of LLM-based Personal Assistants

Neural Information Processing Systems

LLM-based agents have been widely applied as personal assistants, capable of memorizing information from user messages and responding to personal queries. However, there still lacks an objective and automatic evaluation on their memory capability, largely due to the challenges in constructing reliable questions and answers (QAs) according to user messages. In this paper, we propose MemSim, a Bayesian simulator designed to automatically construct reliable QAs from generated user messages, simultaneously keeping their diversity and scalability. Specifically, we introduce the Bayesian Relation Network (BRNet) and a causal generation mechanism to mitigate the impact of LLM hallucinations on factual information, facilitating the automatic creation of an evaluation dataset. Based on MemSim, we generate a dataset in the daily-life scenario, named MemDaily, and conduct extensive experiments to assess the effectiveness of our approach. We also provide a benchmark for evaluating different memory mechanisms in LLM-based agents with the MemDaily dataset.


The Right to Red-Team: Adversarial AILiteracy as a Civic Imperative in K-12 Education

Neural Information Processing Systems

The increasing societal integration of Large Language Models (LLMs) and agentbased AI demands a new civic competency: adversarial reasoning. This position paper argues that K-12 AI education must move beyond passive literacy to actively equip students with skills in responsible adversarial prompting and ethical system "hacking." Such capabilities are essential for citizens to critically probe AI systems, understand their inherent limitations, identify manipulative patterns, and hold them accountable. We posit that cultivating a generation skilled in "red-teaming" AI is vital for maintaining transparency, preventing undue influence, and fostering a democratic engagement with these transformative technologies.


MyoChallenge 2024: ANew Benchmark for Physiological Dexterity and Agility in Bionic Humans

Neural Information Processing Systems

Recent advancements in bionic prosthetic technology offer transformative opportunities to restore mobility and functionality for individuals with missing limbs. Users of bionic limbs, or bionic humans, learn to seamlessly integrate prosthetic extensions into their motor repertoire, regaining critical motor abilities. The remarkable movement generalization and environmental adaptability demonstrated by these individuals highlight motor intelligence capabilities unmatched by current artificial intelligence systems. Addressing these limitations, MyoChallenge'24 at NeurIPS 2024 established a benchmark for human-robot coordination with an emphasis on joint control of both biological and mechanical limbs. The competition featured two distinct tracks: a manipulation task utilizing the myoMPL model, integrating a virtual biological arm and the Modular Prosthetic Limb (MPL) for a passover task; and a locomotion task using the novel myoOSL model, combining a bilateral virtual biological leg with a trans-femoral amputation and the Open Source Leg (OSL) to navigate varied terrains. Marking the third iteration of the MyoChallenge, the event attracted over 50 teams with more than 290 submissions all around the globe, with diverse participants ranging from independent researchers to high school students. The competition facilitated the development of several state-of-the-art control algorithms for bionic musculoskeletal systems, leveraging techniques such as imitation learning, muscle synergy, and model-based reinforcement learning that significantly surpassed our proposed baseline performance by a factor of 10. By providing the open-source simulation framework of MyoSuite, standardized tasks, and physiologically realistic models, MyoChallenge serves as a reproducible testbed and benchmark for bridging ML and biomechanics.


Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning

Neural Information Processing Systems

Large Language Models (LLMs) are increasingly used to simulate human users in interactive settings such as therapy, education, and social role-play. While these simulations enable scalable training and evaluation of AI agents, off-the-shelf LLMs often drift from their assigned personas, contradict earlier statements, or abandon role-appropriate behavior. We introduce a unified framework for evaluating and improving persona consistency in LLM-generated dialogue. We define three automatic metrics--prompt-to-line consistency, line-to-line consistency, and Q&A consistency--that capture different types of persona drift and validate each against human annotations. Using these metrics as reward signals, we apply multiturn reinforcement learning to fine-tune LLMs for three user roles: a patient, a student, and a social chat partner. Our method reduces inconsistency by over 55%, resulting in more coherent, faithful, and trustworthy simulated users.


HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Neural Information Processing Systems

Preference datasets are essential for training general-domain, instruction-following language models with Reinforcement Learning from Human Feedback (RLHF). Each subsequent data release raises expectations for future data collection, meaning there is a constant need to advance the quality and diversity of openly available preference data. To address this need, we introduce HelpSteer3-Preference, a permissively licensed (CC-BY-4.0),